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ABSTRACT 

The fitting of tlie observed redsliifts and magnitudes of type la supernovae to 
wliat we would see in liomogeneous cosmological models has led to constraints 
on cosmological parameters. However, in doing such fits it is assumed that the 
sampled supernovae are moving with the Hubble flow, i.e. that their peculiar 
velocities are zero. In reality, peculiar velocities will modify supernova data 
in a way that can impact best-fit cosmological parameters. We theoretically 
quantify this effect in the nonlinear regime with a Monte-Carlo analysis, using 
data from semi-analytic galaxy catalogs that are built from the Millennium N- 
body simulation. We find scaling relations for the errors in best-fit parameters 
resulting solely from peculiar velocities, as a function of the total number of 
sources in a supernova survey and its maximum redshift Zmax- For low redshift 
surveys, we find that these errors can be of the same order of magnitude as 
the errors due to an intrinsic magnitude scatter of 0.1 mag. For a survey with 

= 2000 and 2;niax = 1-7, we estimate that the expected peculiar velocity- 
induced errors in the best-fit cosmological constant density and equation of state 
can be cta ~ 0.009 and ~ 0.01, respectively, which are subdominant to the 
errors due to the intrinsic scatter. We further find that throwing away supernova 
data below a redshift z ~ 0.01 — 0.02 can reduce the combined error, due to 
peculiar velocities and the intrinsic scatter, but by only about 10%. 

Subject headings: cosmological parameters — cosmology: observations — cos- 
mology: theory — supernovae: general 



1. Introduction 



Type la supernovae (SNe) at a given redshift appear to be dimmer than one would expect 
in a homogeneous Friedmann-Robertson- Walker (FRW) Universe containing only pressure- 
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verse is accelerating. Furthermore, if we restrict ourselves to fitting the observed redshifts 
and magnitudes of type la SNe to the predictions of FRW models, we find that the data 
appear to be best fit by a fiat model with a matter density f^M ~ 0.25 and a cosmological 
constant density Qa ~ 0.75. In this model, the Universe is presently undergoing accelerated 
expansion because of the current phase of dark energy domination. 

There have been recent claims that we may not be justified in fitting SN data with FRW 
models, and thus doing so may have led to an incorrect assessment of the composition and 
behavior of the Universe; for a review, see Celerier (2007) and references therein. Of course, 
even if the Universe is homogeneous on large scales, local large scale structure does undoubt- 
edly perturb the redshifts and apparent magnitudes of type la SNe. So the question at hand 
is: Can such perturbations have a non-negligible impact on the inferences that we draw from 
these data? It has been suggested that the answe r could be "yes", with the largest effects 
coming from peculiar velocities and weak lensing ( Hui fc Greene 20od ; Cooray &: Caldwell 



20061 ). We will focus here on peculiar velocities, whereas Sarkar et al. (2007) performs a 
complementary analysis with the focus on lensing. Although it is really the total combined 
effect of inhomogeneity that is gauge invariant and observable, we are nonetheless allowed 
to look at the peculiar velocity effect alone here in the Newtonian regime. 

The goal of this paper is to theoretically quantify the parameter errors that result from 
the peculiar velocities of type la SNe in the most realistic framework possible. We will thus 
make use of N-body simulation data, making this the first theoretical study of this effect 
that robustly takes into account not only correlated bulk flows, but also fully nonlinear 
(Newtonian) structure formation, both of which are thought to enhance the effect. 

We will find that our results are in accordance with recent estimates from actual SN 
data. A key benefit of our theoretical approach, however, is that we can now estimate the 
size of the effect that we will expect for future SN surveys, such as SNAP for which we find 
the peculiar velocity-induced error in a constant dark energy equation of state to be at the 
1% level. This is subdominant to the ~ 5% error, that we would find with the same survey 
and model fitting, due to a 0.1 mag intrinsic scatter. We will also address the question of 
whether it is beneficial to throw away very low redshift {z < 0.02) SN data when fitting to 
a model, as such data points will be the most affected by these errors; we find that such 
a practice could lead to error reductions of 10% at most, for low redshift surveys. These 
issues are of great importance at a time when the dark energy problem is at the forefront of 
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modern science, and when we are planning ground and space-based facilities to gather SN 
data. 



2. Peculiar Velocities 

Now we will briefly review the effect of peculiar velocities on type la SN data, wherein 
we measure SN redshifts z and luminosity distances to produce a "Hubble diagram." 
To lowest order, peculiar velocities add an extra redshift. For an observer with a peculiar 
velocity Vq and a source with a peculiar velocity v^, an unperturbed redshift 5, and an 
unperturbed luminosity distance i)^, the final (perturbed) redshift is 

2; = 5 + (l + 5)n-(v, -v„) (1) 

and the final luminosity distance is 

Di^{z) = (1 + 2n ■ V, - n ■ v„) ^^(5) , (2) 

where n is a unit vector along the line of sight, pointing from observer to source; these are 
equivalent to Eqs. (11) and (13) of Hui & Greene (2006). Note that we have set the speed 
of light c = 1. Although it is typically assumed that these corrections are negligible, they 
actually can become quite important for very nearby SNe, for which z < (10 — 100)f /c, where 
V = |v|. Having one end of the Hubble diagram with such errors will have more significant 
consequences than one might naively expect when it comes time to fit the data to a model. 

A further complication arises because the peculiar velocities are correlated, and corre- 
lated errors will not decrease as fast as Vn, where is the number of SNe in the sample. 
From Cooray & Caldwell (2006), in the limit of large A^, the final variance of some measured 
parameter will be 



a ^ an 



,2 



^ + ^y. (3) 



where is the ratio of the average off-diagonal covariance matrix element to the average 
diagonal covariance matrix element, and both ctq and r depend on the maximum survey 
redshift Zmax- Note that < < 1, with r corresponding to no correlation and r ^ 1 
corresponding to perfect correlation. Therefore, the first term on the right hand side of 
equation ([3]) corresponds to the uncorrelated Poissonian part of the error, and the second 
term corresponds to the correlated part. We also see that the error cannot be arbitrarily 
reduced simply by having larger A^, as there is a limit of cr — >^ Uor for A^ — > oo. 
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Method 



Although recent studies with real SN data (iNeill et al.l 120071 : iGordon et al.l 120071 ) have 
pointed to peculiar velocities being an important effect, it is useful to quantify this in a very 
controlled theoretical framework. For that purpose, we use publicly available semi-analytic 



galaxy catalogs cr eated at MPA (ICroton et al.ll2006l ). which make use of the Millennium N- 



body Simulation ( Springel et al. 20051 ). The Millennium Simulation followed the evolution 



of ~ 10^° dark matter particles, each with a mass 8.6 x lO^/i^^M©, in a periodic box of size 
500/ h Mpc. This was carried out within the framework of the standard ACDM cosmological 
model. We show a table of the parameters of the simulation and the resulting data in Tabled! 
After semi-analytic galaxy modeling, the resulting data file contains the present-day, z = 0, 
positions and velocities of 9, 925, 229 galaxies in the simulation box. 

We use these data to find the size of the typical errors that one should expect as a 
result of peculiar velocities. To this end, we produce mock SN catalogs by sampling their 
host galaxies randomly from the 9, 925, 229 galaxies in the semi-analytic data. We create 
these catalogs for varying values for the number of SNe and the maximum catalog redshift 
Zmax- With this method we not only explore the effect of sample size, but we also explore 
the effect of varying redshift coverage. Extending this coverage to higher z^nx should reduce 
the resulting errors. Reducing the low-redshift coverage, by throwing away data below a 
minimum redshift -Zmin, also reduces errors by excluding more of the SNe for which the 
peculiar velocity perturbation is large. However, increasing 2;min also means a reduced A^, 
which will increase errors. A key issue then is whether or not it is advantageous to impose 
such a lower redshift cutoff. 

Note that we will be limited by the finite size Xtotai = 500 /h Mpc of the simulation 
box, which implies z^^x ^ 0.1725 for an observer placed at a corner of the box. However, 
we can simulate surveys that go out to higher z^g,^ by assuming that high redshift SNe are 
negligibly correlated, and thus that their velocities can be randomly drawn from a Gaussian 
distribution. In this way, we will compute the errors expected in a survey like SNAP, with 
A^ = 2000 and ^^ax = 1-7. 

For each choice of survey parameters, we randomly create 10, 000 independent mock 
SN catalogs. For each catalog, we also choose a random corner in which the observer will 
reside so as to reduce the effects of cosmic variance. We place observers at corners in order 
to maximize the possible redshift coverage of our mock surveys, and this has the implication 
that our surveys all cover one octant of the sky, or ~ 5, 000 square degrees. Then, for each 
SN, we find its comoving distance x from the observer using our knowledge of the position of 
the observer and of the SN's host galaxy. The unperturbed redshift z is found by numerically 
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inverting the function 



dz' c r dz' 



(4) 



where h = Hq/{100 km/s/Mpc) = 0.73, Qm = 0.25, and = 0.75 arc the values used in the 
underlying N-body simulation. The unperturbed luminosity distance is then = (1 + z)x 
and the unperturbed distance modulus is 



Next, the unperturbed rcdshifts and distance moduli arc perturbed to linear order in the 
host galaxy peculiar velocities given in the semi-analytic data, while we assume that the 
data have already been corrected for the (known) peculiar velocities of the observers. Note 
that we are justified in perturbing linearly in velocity, as {v/cY <^ v/c for all galaxies. 
Thus, for a SN with unperturbed data (S^, fhi) and a peculiar velocity along the line of sight 
I'j, the perturbed redshift is Zi — + (1 + Zi)vi and the perturbed distance modulus is 
rrii — rhi-\- lOt'j/ In(lO). 

Finally, the perturbed data [zi, rrii) are fit to FRW models with a standard least-squares 
minimization. There are three different models that we consider, which are all simplifications 
of a general model that has four parameters: the rescaled Hubble constant h, the matter 
density today Qm, the dark energy density today Q^dei and the dark energy equation of 
state w. Note that for actual SN observations one must fit the data for the calibration 
magnitude M; for our purposes here we will ignore this complication. In this model, the 
Hubble parameter as a function of redshift is 



where Hq — /i(100 km/s/Mpc). The three simphfied FRW models to which we fit are: 

(i) a fiat model (Qm + ^de — 1) with a cosmological constant {w — —1), where we fit to 
find h and Qj^f, 

(ii) a curved model with a cosmological constant where we fit to find /i, Qm, and JIa = ^de-i 




(5) 



1 1/2 

+ {l-nM-^de){l+zf 



(6) 



and 



(iii) a fiat model where we fit to find /i, VLm, and w. 
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These models are summarized in Table [2l For each model, we look at the statistical distribu- 
tions of the best-fit parameters of our 10, 000 realizations. We find that the distributions are 
roughly Gaussian with means that approximately equal the underlying simulation values. 
We concern ourselves with the standard deviations, which are representative of the errors to 
be expected from peculiar velocities in each scenario. 

To check our results, we will use two auxiliary methods for estimating the parameter 
errors. The first method involves generating "synthetic" survey data, wherein we do not use 
the N-body data but instead we choose the velocities at random from a Gaussian distribution 
with the known variance (f^). Then, proceeding as before, we will find the contribution to 
the error that is due solely to uncorrelated nois e, i.e. we will find th e first term in equation 



Secondly, we can estimate Fisher matrices (IHui &: Greenell2006l ) 



aa • 



where Cij = {6mi6mj), pa = (/i, ^a/ , etc.), and the error in parameter pa is cTq, = >/[F~^ 
The Fisher matrices provide a useful check of our code, and they also give us an analytic 
understanding of the scaling of the uncorrelated errors with and z^^.^ for each case. We will 
then fit the final variances as cr^otai — ^pois + "^corr^ where cipois is the uncorrelated component 
of the error and dcorr is the correlated component. We expect that (Xpois scales as 
and both cXpois and cTcorr are power laws in 2;max for -Zmax ^ 1- Hence, we expect errors of the 
form cr^otai ~ 

^Zm■^.JN + Bzl^.^^ for small z^ax; this expectation comes from the findings of 
Vanderveld et al. (2007), Section VB. 



4. Results and Discussion 

For model (i), with ^min = 0, we show some of our results in Table [31 Here ah is the 
standard deviation of the errors in h and ctm is the corresponding error in VLm-, where we are 
only considering the errors due to peculiar velocities. These results are best fit by 

al ^ 2.5 X 10-^^ + 2.6 x l()-''z^l^ (8) 

and 

0.0011^ + 5.8 xlO-^i^^L', (9) 

where the first term in each of these is the uncorrelated piece that matches what we find 
using results from our synthetic surveys. This also matches what we expect from the Fisher 
matrix for this fit. Furthermore, for z.^g,y^ = 0.1725, we find that r ^ 0.025. We find that 
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this value of r is consistent with the statistics of the galaxy velocities in the simulation box, 
where we estimated using 5, 000 random galaxies, 



J2i^iViVj /0.03125- 0.00664 „ , , 

^ ~ / ~ 0.027 . (10) 



{N-l)^^vf V 4,999 x 0.00664 

One further issue is whether or not it is advantageous to throw away low redshift data 
points. To explore this, we add a random intrinsic scatter of 0.1 mag to the SN magnitudes, 
in addition to the peculiar velocity error. We then throw away all data points that have a 
final (observed) redshift below a cutoff Zmin- The resulting errors are shown in Table HI for 
= 500 and ^max = 0.1725. We thus find that the optimal minimum redshift is 2;min ~ 0.02, 
for which we find a 7% reduction in total error. We also find, for these survey parameters, 
that the error due to peculiar velocities is the same order of magnitude as the error due to 
the intrinsic scatter alone. 

We give some of the results of fit (ii), for z^i^ = 0, in Tabled where a a is the error in 
^A- These errors scale as 

al ^ 0.0035^ + 2.8 x lO'^^-^.^ , (11) 

-6 



0.43^ + 6.1 xlO-^^-L, (12) 



and 



^2 ^0.16^ + 0.0012^^^^ (13) 

This time we find r = 0.036 when z^ax = 0.1725, which is still consistent with our prior 
rough estimate from galaxy statistics. By adding an intrinsic magnitude scatter of 0.1 mag 
to a survey with = 500 and Zmax = 0.1725, we now find the optimal minimum redshift 
to be z^ain = 0.01, this time yielding a total error reduction of 9%. We find once again that 
the error due to peculiar velocities is of the same order of magnitude as the error due to this 
scatter alone. 

Riess et al. (1998) fit to model (ii), for a survey that has a low-redshift sample of 34 SNe 
with z < 0.15 and a high-redshift sample of 15 SNe with 0.16 < z < 0.62 (one SN with z = 
0.97 was included in some of their analysis). We (roughly) simulate this scenario by assuming 
that the SNe in the high-redshift sample are uniformly distributed and negligibly correlated, 
and thus their velocities can be randomly drawn from a Gaussian distribution. With this 
method, we estimate the errors for Riess et al. (1998) to be (cr/i, ctm, cta) ~ (0.009,0.7,0.6), 
which are about a factor of 3 smaller than the quoted total errors. For a survey like t hat of 



Neill et al. (2007), using data from the Supernova Legacy Survey (lAstier et al.ll2006l ) with 
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44 SNe with 0.015 < z < 0.125 and 71 SNe with 0.25 < z < 1, we find that the expected 
errors are (o"/j, 0"^/, cta) ~ (0.003,0.07,0.07), in accordance with the finding Af2A = —0.04 in 
that paper. Furthermore, for a SNAP-hke survey with 500 SNe with z < 0.17 and 1500 SNe 
with 0.17 < z < 1.7, we find {ah,aM,(rA) ~ (0.0008,0.005,0.009). 

For model (iii), proceeding as before, we give some of our results in Tableland we find 
the following scaling relations reproduce the standard deviations when Zmin = 0: 

^2 ^ 0.0041^ + 4.9 xlO-«z-i;f, (14) 

al, ^ 0.040^ + 3.0 X 10"^;^-^^ , (15) 

and 

^2 ^Q^^2^^3 9^^Q-4^-5^r ^ (16) 

where is the error in w. We now find the optimal minimum redshift for this fit to be 
Zmin = 0.01, with a total error reduction of 11%. Furthermore, for the aforementioned 
SNAP-like survey, we estimate the peculiar velocity-induced error to be {crh,o'M,o'w) ~ 
(0.001,0.002,0.01). 

This error should be compared with the other sources of error in such a survey, such 
as the intrinsic SN magnitude spread and gravitational lensing. Still for model (iii), we find 
that an intrinsic spread of 0.1 mag produces the errors (a^, a\^, cr^) ~ (0.003, 0.01, 0.05), and 
from Sarkar et al. (2007) we see that gravitational lensing produces a corresponding error 
of al ^ 0.006. This means that the noise caused by peculiar velocities is greater than that 
caused by lensing, but smaller than that caused by a 0.1 mag intrinsic scatter. Therefore 
total combined effect is dominated by the intrinsic scatter: a^^^^^ ^ aj^, ~ 0.05. 

We can compare our results to those of Hui & Greene (2006), who perform a linear- 
order analysis, to find that nonlinear corrections appear to be relatively unimportant, as one 
might expect. For example, from Fig. 4 of Hui & Greene (2006), which shows results for a 
similar SNAP-like survey that incorporates a low redshift sample, we see that the additional 
error on w due to peculiar velocities is of order ~ 0.01 — 0.02. Keeping in mind that 
our survey structures and model fits differ somewhat, this still shows that there is not a 
significant difference in the rough size of our results. This is not to say, however, that the 
correlated nature of large scale structure does not play a crucial role in determining these 
errors, also as emphasized by Hui & Greene (2006). 

We now address some of the assumptions of the above analysis. First, we considered 
only the case with a sky coverage of approximately 5, 000 square degrees. This should not 
matter much, as Hui & Greene (2006) found that the area and geometry of a survey do not 
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have a significant impact as long as the area is large enough and the geometry does not have 
any unusual features. Thus, our results should be representative of surveys with large areas 
for lo w redshifts; this would be true for any survey that incorporates the Nearby SN Factory 
data (jWood-Vassey et al.ll2004l ). We also chose the galaxies at random within the simulation 
box, when in reality there are selection effects that modify the redshift distribution. In our 
analysis, the total number of SNe as a function of redshift increased like z^, where in reality 
this rises faster for low z and then eventually drops off at 2; ~ 1. This means that a real 
survey would have a larger fraction of its SNe at lower redshifts, which would amplify the 
peculiar velocity error computed here. 

Furthermore, we ignored the velocities of SNe with respect to their host galaxies. Such 
internal velocities are typically not larger th an the host galaxy velocities and they are not 
correlated between SNe in different galaxies (IHui fc Greenell2006l ). meaning that they would 
only amplify somewhat the Poissonian pieces of the errors quoted above. Another assumption 
in our analysis is that the Universe is roughly homogeneous on the scale of the simulation 
box, and thus we did not address what happens if there are very large scale bulk flows. 

For each model, we found that excluding data points below a redshift of ^min ~ 0.01—0.02 
can improve the situation slightly, by reducing the total error by as much as ~ 10%. A far 
better alternative to cutting data could be to use local flow models to try to estimate, and 
subsequently correct for, peculiar velocities of local sources ( iNeill et al.l 120071 ). An analysis 
of the potential beneflts of such methods is the subject of future work. 
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Table 1. Parameters of the MPA semi- analytic galaxy catalogs produced from the 

Millennium N-body simulation. 



Parameter Definition 


Value 




total matter density today 


0.25 




baryon density today 


0.045 


Qa 


dark energy density today 


0.75 


w 


dark energy equation of state 


-1 


n 


initial power spectrum slope 


1 




perturbation amplitude today 


0.9 


A'total 


number of galaxies 


9,925,229 


3^total 


simulation box size 


500/h Mpc 




> ])eculiar velocity variaiice 


1.271 X IQ-^ 


Table 2. Summary of the three models for our fits. 


Model 


h ^de W 




(i) 


Varied Varied 1-^^^/ —1 




(ii) 


Varied Varied Varied —1 




(iii) 


Varied Varied 1-0m Varied 




Table 3. Parameter errors for model (i) with ^min = 0. 




N (Th (Tm 





0.1725 500 
750 
1000 
0.15 500 
0.13 



0.0043 
0.0038 
0.0032 
0.0061 
0.0082 



0.057 
0.049 
0.043 
0.093 
0.15 
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Table 4. The ^^min dependence for model (i), with = 500 and ^^max = 0.1725, and an 

added intrinsic scatter of 0.1 mag. 



^min CT/i O M 



0.0073 0.099 

0.01 0.0069 0.094 

0.02 0.0068 0.093 

0.03 0.0070 0.095 



Table 5. Parameter errors for model (ii) with z-^i^ — 0. 





N 








0.1725 


500 


0.019 


7.4 


4.6 




750 


0.018 


6.9 


4.3 




1000 


0.016 


6.4 


4.0 


0.15 


500 


0.023 


12 


7.1 


0.13 




0.026 


18 


10 



Table 6. Parameter errors for model (iii) with z-^^^ = 0. 





N 








0.1725 


500 


0.021 


2.6 


4.4 




750 


0.018 


2.3 


3.9 




1000 


0.016 


2.1 


3.6 


0.15 


500 


0.023 


3.7 


6.1 


0.13 




0.027 


6.0 


9.6 



